o 

(N 
>v 



(N 

u 



o 



Using a Skewed Hamming Distance to Speed Up 
Deterministic Local Search 



Dominik Scheder 

Theoretical Computer Science, ETH Zurich 

CH-8092 Zurich, Switzerland 

dschederOinf . ethz . ch 



Abstract. Schoning 13 presents a simple randomized algorithm for 
(d, fc)-CSP problems with running time I ' J~ ' 1 poly(n). Here, d is 
the number of colors, k is the size of the constraints, and n is the num- 
ber of variables. A derandomized version of this, given by Dantsin et 
al. [2], achieves a running time of ( -^j ) poly(n), inferior to Schoning's. 
We come up with a simple modification of the deterministic algorithm, 
>■• ' achieving a running time of ( - - ~ ' • Jd_ 1 ) P°ly( n )- Though not corn- 

er , pletely eleminating the gap, this comes very close to the randomized 

bound for all but very small values of d. Our main idea is to define a 
graph structure on the set of d colors to speed up local search. 



1 Introduction 

r-> 

00 

■^j- ■ Constraint Satisfaction Problems, short CSPs, are a generalization of both bool- 

ean satisfiability and the graph fc-colorability problem. A set of n variables 

f--4 ■ xi, . . . , x n is given, each of which can take a value from [d] :— {1, . . . , d}. The 

f^ \ values 1, . . . , d are sometimes called the colors. Each coloring of the n variables, 

also called assignment, can be represented as an element of [d] n . A literal is an 
expression of the form (xi ^ c) for some c £ [d], A CSP formula consists of a 
conjunction (AND) of constraints, where a constraint is a disjunction (OR) of 

rS | literals. We speak of (d, fc)-CSP formula if each constraint consists of at most 

j^ ■ k literals. Finally, (d, fc)-CSP is the problem of deciding whether a given (d, k)- 

CSP formula has a satisfying assignment. 

In 1999, Uwe Schoning [13 came up with an extremely simple and elegant al- 
gorithm for (d, /c)-CSP: Start with a random assignment. If this does not satisfy 
the formula, pick an arbitrary unsatisfied constraint. From this constraint, pick 
a literal uniformly at random, and assign to its underlying variable a new value, 
again randomly. Repeat this reassignment step 0(n) times, where n is the num- 
ber of variables in the formula. If the formula F is satisfiable, we find a satisfying 
assignment with probability at least (k/(d(k — l)))"/poly(n). By repeating this 
procedure (d(k — l)/fc) ra poly(n) times, we obtain an exponential Monte Carlo 
algorithm for (d, fc)-CSP which we will call Schoning. Not long afterwards, in 
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2002, Dantsin, Goerdt, Hirsch, Kannan, Kleinberg, Papadimitriou, Raghavan 
and Schoning 2 designed a deterministic algorithm based on deterministic local 
search and covering codes. This algorithm, henceforth called det-search, can 
be seen as an attempt to derandomize Schoning's random walk algorithm (ac- 
tually the authors cover only the case d = 2, but everything nicely generalizes 
to higher d). I say attempt because its running time of (dk/(k + l))"/poly(n) is 
worse than that of Schoning. 

Consider the following variant of det-search: Suppose F is a (d, fc)-CSP 
formula on n variables, with d = 2 l being a power of 2. Replacing every d-ary 
variable by £ boolean variables, we transform F into a (2,£k)-CSP formula F' 
over In variables. We solve F' using the original algorithm det-search for the 
boolean case. A quick calculation shows that this already improves over the run- 
ning time of (<ifc/(fc+l))™/poly(n). This observation motivates a more systematic 
exploration of possible ways to speed up det-search. The main contribution of 
this paper is a modified det-search algorithm, which achieves a significantly 
better running time. Both Schoning and det-search work by locally exploring 
the Hamming graph on [d] n , in which two assignments are connected by an edge 
if they differ on exactly one variable. We define a graph G on the set {1, . . . , d} 
of colors, thus obtaining a different, sparser graph on [d] n , the n-fold Cartesian 
product G Dn : Two assignments are connected by an edge if they differ on exactly 
one variable, and on that variable, the two respective colors are connected by 
an edge in G. With G = Kd, this is the Hamming graph on [d] n . Taking G to 
be the directed cycle on d vertices, it turns out that our modified deterministic 
algorithm has a running time of 



d(k - 1) k d 



k 



k d -1 



poly(n) 



For d > 3, this is significantly better than det-search and comes very close to 
Schoning except if d is very small (in particular, we do not improve the case 
d = 2). We hope that future research will eventually lead to a complete deran- 
domization. We compare running times for some values of d and k (ignoring 
polynomial factors in n): 



(d,k) 


Schoning 


det-search 


this paper 


(2,3) 


1.334" 


1.5" 


1.5" 


(3,3) 


2™ 


2.25" 


2.077™ 


(5,4) 


3.75™ 


4™ 


3.754™ 



The case of (2, /c)-CSP, commonly called fc-SAT, has drawn most attention, 
in particular 3-SAT. For 3-SAT, Schoning and det-search achieve a running 
time of 0(1.334") and 0(1.5"), respectively. This is still very close to the cur- 
rent records: By combining Schoning with a randomized algorithm by Paturi, 
Pudlak, Saks, and Zane [9], Iwama and Tamaki [7] achieved a running time of 
0(1.3238"). Later, Rolf [IT] improved the analysis of their algorithm to obtain 
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the currently best bound of 0(1.32216"). The algorithm det-search has been 
improved as well, hrst to 0(1.481") by the same authors, then to 0(1.473") 
by Brueggemann and Kern |T, and finally to the currently best deterministic 
bound of 0(1.465") by myself [T2]. Though we do not improve the case d = 2 in 
this paper, we hope that better understanding of general (d, fc)-CSP will lead to 
better algorithms for fc-SAT, as well. 

Another fairly well-investigated case is k — 2 with d being large. In 2002, 
Feder and Motwani 3 adapted a randomized fc-SAT algorithm by Paturi, Pudlak 
and Zane 10 to (d, 2)-CSP, obtaining a running time of (cdd) n , with Cd converg- 
ing to e" 1 as d grows. We see that the base of the exponential term is proportial 
to d. A certain growth of the base with d seems inevitable: Recently, Traxler [14] 
showed that an algorithm solving (d, 2)-CSP in time a", with a being indepen- 
dent of d, could be used to solve fc-SAT in subexponential time, i.e., 2°^ n \ This 
would contradict the exponential time hypothesis [6 . 

Organization of this paper 

In Section [2] we describe Schoning and det-search, and analyze the running 
time of the latter. Although most of the material of Section[2]is from [13] and [2], 
we chose to present it here in order to make the paper self-contained. In Section[3l 
we define a graph structure on the set of colors, which changes the notion of 
distance on the set [d] n . Taking this graph to be the directed cycle on d vertices 
yields a significant improvement. In In Section [4j we show that choosing this 
graph is optimal. 

2 The Algorithms Schoning and det-search 

Schoning's algorithm works as follows. We start with a random assignment, 
and for 0(n) steps randomly correct it locally. By this we mean choosing an 
arbitrary non-satisfied constraint C, then choosing a literal (x ^ c) 4— u . a , r . C 
(where 4— u . a . r . means choosing something uniformly at random), and randomly 
re-coloring x with some d 4— u . a . r . [d] \ {c}. 

Theorem 2.1 ( [13j ) . If F is a satisfiable (d,k)-CSP formula on n variables, 
then One-Schoning-Run returns a satisfying assignment with probability at least 

fd{k-\ 



\ k J poly(n) 

By repeating One-Schoning-Run (rf(fc — l)//c))"poly(n), times, we find a sat- 
isfying assignment with high probability. This yields a randomized Monte-Carlo 
algorithm of running time (d(k — l)/fc))"poly(n), which we call Schoning. 

Let us describe the algorithm det-search from [2 . We define a parametrized 
version of CSP, called Ball-CSP. Given a (d, fc)-CSP formula F, an assignment 
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Algorithm 1 One-Schoning-Run^: a (d, fc)-CSP) formula 
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a «- u .a.r. [d] n 

for i — 1, . . . ,cn do 

/ / c is a constant depending on d and k, but not on n 

if a satisfies F then 
return a 

else 

C <— any constraint of F unsatisfied by a 
(x t^ c) <— u.a.r. C J J a random literal from C 

c <— u.a.r. [rf] \ {c} // choose a new color for x 

a •(— a[x :— c] II change the coloring a 

end if 
end for 
return unsatisf iable 



a G [d] n and an r £ No, does there exist a satisfying assignment /3 such that 
dn{a,(i) < r? Here, 

d H (a, /3) = \{1 < % < n | a(x 4 ) # /3(x 4 )}| 

is the Hamming distance, and 

SW(a):={/3e[d]"|d ff (a,/3)<r} 

is the Hamming ball of radius r around a. In other words, Ball-CSP asks 
whether B r (a) contains a satisfying assignment. The algorithm searchball 
solves it in time (k(d— l)) r poly(n). To show correctness, suppose the ball Br (a) 

Algorithm 2 searchball (CSP formula F, assignment a, radius r) 
T 
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if a satisfies F then 

return true 
else if r = then 

return false 
else 

C -«— any constraint of F unsatisfied by a 
for (x^c) eC do 
for c' € [d] \ c do 
a' -«— a[x := c] 
if searchball (F, a', r — 1) = true then 

return true 
end if 
end for 
end for 
return false 
end if 



contains a satisfying assignment j3, and let C be a constraint not satisfied by 
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a. At least one literal [x ^ c) E C is satisfied by /3, and in one iteration of 
the inner for-loop, the algorithm will change the assignment a to a' such that 
a'(x) = ft(x), and therefore dn{ct' ,/?) = djj(a,/3) — 1 < r — 1, and at least one 
recursive call will be successful. The running time of this algorithm is easily seen 
to be at most (k(d — l)) r poly(n), as each call causes at most k(d — 1) recursive 
calls (see Figure Q] for an illustration), and takes a polynomial number of steps 
itself. 



(a; ^ 1) V (y # 2) V (z ? 2) 




Fig. 1. searchball branching on a constraint of a (3, 3)-CSP formula. 



Covering Codes 

How can we turn this algorithm into an algorithm for searching [d] n for a satis- 
fying assignment? Suppose somebody gives us a set C C [d] n such that 

aec 

i.e. a code of covering radius r. By calling searchball(i ? , a,r) for each a £ C, 
we can decide whether [d] n contains a satisfying assignment for F in time 

|C|(fc(d-l)r P oly(n). (1) 

By symmetry of the cube [d] n , the cardinality of Br (a) does not depend on 
a, and we define voh '(n, r) := \B r {a)\- The following lemma gives bounds on 
the size of a covering code C. 

Lemma 2.2 (|2j). For all n,d,r, every code C of covering radius r has at least 
— 774U — r elements. Furthermore, there is such a C with 

vol^J (n,r) ' 

\ c \ < —n^ — P ol yM , 

~ vol (d) (n,r) 
and furthermore, C can be constructed deterministically in time |C|poly(n). 
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This lemma, together with (JT]), yields a deterministic algorithm solving (d,k)- 
CSP in time — r^ — Ak(d — l)) r poly(n), and we are free to choose r. At this 

point, Dantsinetal. use the estimate vol (2) (n,r) = ^[ =0 (") > 2 nH( - r ^ /poly(n), 
where H{x) is the binary entropy function (see Mac Williams, Sloane [8], Chapter 
10, Corollary 9, for example), but we prefer to derive the bounds we need our- 
selves, first because the calculations involved are simpler, and second because our 
method easily generalizes to the volume of more complicated balls we will define 
in the next section. We use generating functions, which are a well-established 
tool for determining the asymptotic growth of certain numbers (cf. the book 
generating functionology j!5j). 

Lemma 2.3. For any ri,d £ N and x > 0, there is an r S {0, 1, . . . , n} such 
that 

vol^(n,Q>^-( 1 + ^- 1 ^". 
n + 1 x r 



Proof. We write down the generating function for the sequence ((") (d — l) 1 ) . : 
(l+(d— l)x) n = X)"=o (ij (d—tyx 1 - This sum involves n+1 terms, the maximum 
being attained at i = r for some i £ {0, . . . , n}. Thus (1 + (d — \)x) n < (n + 
l)(")(d — l) r x r . Using vol^ (n,r) < (")(d — l) r and re-arranging terms yields 
the claimed bound. □ 

Theorem 2.4. There is a deterministic algorithm solving (d,k)-CSP in time 

&) poiy(n). 

Proof. Choose x := (k(d — l))^ 1 and apply Lemma [2.31 The lemma gives us 
some r £ {0, . . . , n}. With this radius, the running time in (JIJ is at most 

"'" -(k(d - l))''poly(n) = ^^(fc(^l)r poly(n) 



vol (d) (n,r) (l + (d-l)x) n 

poly(n) = ( — — - I poly(n) 



l + ( d - l )k(tT)) Vfe + 1 



Let us summarize the algorithm det-search: It first constructs a code of appro- 
priate covering radius, then calls searchball for every element in the code. Its 
running time is larger than that of Schoning, since dk/(k + 1) > d(k — l)/&. 

3 G-Distance, G-Balls, and G-searchball 

Let [d] be the set of colors, and let G = ([d],E) be a (possibly directed) graph. 
For two colors c,c', we denote by dG{c,c') the length of a shortest path from 
c to c' in G. If G is directed, this is not necessarily a metric, and therefore we 
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rather call it a distance function. It gives rise to a distance function on [d] n : For 
two assignments ct,/3£ [d] n , we define 

d G (a,f3)=Y,d G (a l ,(3 l ) . (2) 



This is the shortest-path distance on the n-fold Cartesian product G . This 
distance induces the notion of balls B r '(a) :— {/3 £ [d] n \ da(a,/3) < r}, and 
of dual balls {(3 £ [d] n \ dG((3,a) < r}. If G is undirected, balls and dual balls 
coincide, and for G being Kd, the complete undirected graph, da is simply the 
Hamming distance. If G is vertex-transitive (and possibly directed), the cardi- 
nality \B r (a) | does not depend on a, and we define voh (n,r) := \B r (a)|. 
By double-counting, this is also the cardinality of dual balls. In particular, a 
vertex-transitive graph is regular. Let 5 denote the number of edges leaving each 
vertex in G. As before, we define a parametrized problem: Given F, a and r, 
does Br ' (a) contain a satisfying assignment? Algorithm [31 almost identical to 
Algorithm^ solves this problem in time (<5fc) r poly(n). 



Algorithm 3 G-searchball(CSP formula F, assignment a, radius r) 
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if q satisfies F then 

return true 
else if r = then 

return false 
else 

C <— any constraint of F unsatisfied by a 
for (x^c) eC do 

for all c such that (c, c) £ E(G) do 
a -«— a[x := c] 
if searchball(F, a',r — 1) = true then 

return true 
end if 
end for 
end for 
return false 
end if 



3.1 Covering Codes, Again 

Using the G-distance function instead of the Hamming distance also induces the 
notion of covering codes. As before, C C [d] n is a code of covering G-radius r if 



\jB r G \a) = {dr 



a£C 
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The following lemma generalizes Lemma [2.2l to arbitrary vertex-transitive graphs 
G on d vertices. The proof does not introduce any new ideas, and can be found 
in the appendix. 

Lemma 3.1. Let d > 2, and let G be a vertex transitive graph on d vertices. 
For all n and < r < n, every code C of covering G-radius r has at least 
d n /vo\ (n,r) elements. Furthermore, there is such a C with 

\d] n , , s 

\C\ < —^- -poly(n) , 

voi v '(n,r) 

and C can be constructed deterministically in time |C|poly(n). 

By calling G-searchball(F, a, r) for each a E C, we can solve (d, fc)-CSP deter- 
ministically in time 

''" Wpoly(n) , (3) 



vol (G) (n,r) 

where we are free to choose any vertex-transitive graph G and any radius r. Let 
us reflect over ^ for a minute. Taking a graph with many edges results in balls 
of greater volume, meaning a smaller C but spending more time searching each 
ball. Taking G to be rather sparse has the opposite effect. What is the optimal 
graph G and the optimal radius r? 



3.2 Directed Cycles 

Let us analyze the algorithm using G — Cd, the directed cycle on d vertices. 
Clearly, S — 1, and therefore G-searchball runs in time k r . This is as fast as 
we can expect for any strongly connected graph. What is vol^ d '(n, r)l 

Lemma 3.2. For any n,d E N, and x > 0, there is an r 6 {0, . .., (d — l)n} 

such that 

(l + x + --- + x d - 1 ) n 1 



vol (Cd) (n, r) > 



poly(n) 



Proof. Define T(n,s) := vob d '{n, s) — vor d '{n, s — 1). This is the number 
of assignments having distance exactly s from a fixed assignment a. Also, it is 
the number of vectors a E {0, . . . , d — 1}" with X)"=i a « = s - Writing down its 

generating function, we see that (1 + x -\ + x d_1 ) n = X)l=o ^( n > s)x s . For 

some r E {0, 1, . . . , (d — l)n + 1} that maximizes T(n, r)x r , we obtain 

(d-l)n 

{l + x + --- + x d - 1 ) n = Y, T(n,s)x s <{{d-l)n + l)T(n,r)x r 

s=0 

Solving fo T(n, r) proves the lemma. □ 
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We apply this lemma for x = 4 and obtain a certain radius r, for which we 
construct a code C of covering G-radius r. Combining Lemma 13.21 with ([3"Th we 
obtain a running time of 



(l + X-\ Yx d - l ) n \ k k d -l 

and we have proven our main theorem. 

Theorem 3.3. For all d and k, there is a deterministic algorithm solving (d, k)- 
CSP in time 

'rf (fc-l) k d \ n 

I— ■ yr—i) poly(n) ■ 

There is one issue we have consistently been sweeping under the rug. We proved 
Lemma 12.31 and Lemma 13.21 but never addressed the question what radius r 
fulfills the stated bound. For the analysis this does not matter, since r cancels 
out nicely. However, if we were to implement the algorithm, we would have to 
choose the right radius. This is not difficult: In Lemma T3.21 the correct r is the 
one maximizing T(n 1 r)x r , and T[n,r) can be computed quickly using dynamic 
programming. 

4 Optimality of the Directed Cycle 

We will show that our analysis cannot be improved by choosing a different vertex- 
transitive graph G or a different radius r. We ignore graphs that are not vertex- 
transitive because we have no idea on how to upper bounding the running time 
of G-searchball, not to speak of estimating the size of a good covering code. 

Let G be a vertex-transitive graph on d vertices. For some vertex u £ V(G), 
we denote by di the number of vertices v £ V{G) having da(u, v) — i. Since G 
is finite, the sequence d ,di, . . . , eventually becomes 0. Denoting the diameter 
of G by s, it holds that di = for all % > s. If G is connected (which we do not 
necessarily assume), the di add up to d. Since G is vertex-transitive, the di do not 
depend on the vertex u. Clearly, G is di-regular, and G-searchball runs in time 
(dik) r po\y(n) on a (d, fc)-CSP formula. How do we estimate vor '(n, r)l Again 
we define T{n,r) — vol '(n,r) — vop '(n,r — 1), i.e., the number of elements 
having distance exactly r from some fixed a. The Ta{n,r) obey the recurrence 

s 

T(n, r) = 2_. d%T{n — 1, r — i) . 

i=0 

This is easy to see: Fix a £ [d] n . How many (3 are there such that dc{a, ft) = rl 
Consider the first coordinates a^ and /?i. If dc(oti,/3i) — i, then there are di 
ways to choose /?i , and the distances at the remaining n — 1 positions add up to 
r — i. Some moments of thought reveal the following identity: 

Cs \ n sn 

j>s*) =Y f T(n,i)x i 
i=0 / 2 = 



X 

Before, we were interested in bounding vor ' (n, r) from below. Now we want 
to bound it from above, because we want to argue that any code C C [d] n of 
covering radius r must necessarily be large, and the algorithm must be slow. 

Lemma 4.1. For any n 6 N, r G {0, 1, . . . , sn} and any x G [0, 1], it holds that 

vo i(% jr) <CkM!. 

x r 

Proof. (E£=o**T = ESo^Ms* > EU^iV > E[=o T (M* r = 

x r vor '(n, r), and for the last inequality we needed that x G [0, 1], thus x l > x r 
for i < r. Re-arranging terms yield the claimed bound. □ 

Clearly any code C C [d] n with U Q eC ^ ( a ) = M" mus t satisfy 

rf" 



ICI > 



vol (G) (n,r) 
Since G-searchball takes time (kdi) r , the total running time is at least 

dn -( kdl y> dnxr{kd ^ r 



vol^(n,r) v LJ -(ElodixT' 

where this inequality holds for all choices of x. Setting x = 7-7-, we see that the 
running time is at least 

d n 

(EU^-'drT ' 

In a di-regular graph, the number of vertices at distance i from u can be at most 
d\. In other words, d, < d\, and the above expression is at least 

d" 



which, up to a polynomial factor, is the same as what we get for the directed 
cycle on d vertices. 

5 Conclusion and Open Problems 

We can apply the same idea to Schoning's algorithm: When picking a literal 
{x ^ c) uniformly at random from an unsatisfied constraint of F (see line 8 of 
One-Schoning-Run), we choose a new truth value c' uniformly at random from 
the set {d G [d] \ (c,d) G E(G)}. With G = Kd, this is the original algorithm 
Schoning, and surprisingly, for G being the directed cycle, one obtains exactly 
the same running time (d(k — l)/fc)"poly(n). Since the analysis of this modified 
Schoning does not introduce any new ideas, we refer the reader to the appendix 
and to Andrei Giurgiu's Master's Thesis 4 , which presents a general framework 
for analyzing random walk algorithms for SAT. Our main open problem is the 
following. 
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For which graph on d vertices does the modified One-Schoning-Run 
achieve its optimal success probability? 

If we had to, we would guess that no graph can improve Schoning's algorithm. 
Intuitively, it does not make sense to restrict the random choices the algorithm 
can make, because we have no further information on which choice might be 
correct. In the deterministic case, where every branch is fully searched, it seems 
to make more sense to restrict the choices of the algorithm, since this yields an 
immediate reduction in the running time of G-searchball. 
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A Schoning's Algorithm With Directed Cycles 

Can we apply the same idea to Schoning's algorithm? When picking a literal 
(x ^ c) uniformly at random from an unsatisfied constraint of F (see line 8 of 
One-Schoning-Run), we choose a new truth value c' uniformly at random from 
[d] \ {c}. We modify this algorithm as follows: Using a graph G with vertex set 
[d], we choose the new color uniformly at random from the set {d € [d] | (c, c') € 
E(G)}. If G = Kd, this is nothing new. What if G is the directed cycle? Let the 
d colors be 0, 1, . . . , d — 1 and let the edges be (i, i + 1) (addition taken modulo 
d). This means that we always change color c to color c + 1. Let /3 be a fixed 
satisfying assignment and a be the current (non-satisfying) assignment in the 
algorithm One-Schoning-Run. If /3 satisfies the literal (x ^ c), i.e. (3(x) ^ c, 
then changing the color of x from c to c + 1 decreases the distance from a to /3 
by 1. Otherwise, if j3(x) = c, then the distance from a to /3 increases by d — 1. 
If C is a constraint involving fc literals and which is unsatisfied by a, then with 
probability at least \ we choose a literal that is satisfied by j3, and decrease the 
distance by 1, and with probability at most ^jr-, we choose a literal not satisfied 
by f3, increasing the distance by d — 1. To analyze the algorithm, we define a 
Markov chain (see Figure [2]): The states of the Markov chain are No U {S}, with 




i ~ 1 3 j + d-1 

Fig. 2. Part of the Markov Chain 



S being a special starting state. The states j £ Nq represent the distance from a 
to some fixed satisfying truth assignment /3. The transition probabilities are as 
follows: For < j < (d— l)n, the probability ps,j of going from S to j is G ^"'^ , 

where Ta(n,j) = vol' (n,j) — voh (n,j — 1) is the number of assignments a 
such that dG(a,(3) — j. When taking a step from S to some j according to the 
transition probabilities, j will be distributed exactly as ^(a, /3) for a 6 u . a .r. [d] n - 
Furthermore, for j > 1, Pjj-i is r> and Pjj+d-i = ^x^") an d Po,o = 1- Here, we 
only sketch analysis of this Markov chain. For details, please see Giurgiu's Master 
Thesis [4]. The probability of One-Schoning-Run finding a satisfying assignment 
is at least the probability of this Markov chain reaching state after at most en 
steps, with c being the constant in line 2 of One-Schoning-Run. As it turns out, 
the probability that we reach in at most en steps, conditioned on the event 
that is reached at all, is rather high. Note that with positive (in fact, quite 
large) probability, we will never reach state 0. Therefore, to analyze the success 
probability of One-Schoning-Run, it suffices to lower bound the probability that 
our random walk eventually reaches 0. Let Pj be the probability that a random 
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walk starting in state j eventually reaches 0. The Pj obey the equation 

Pj = \Pi-i + ^Pj+d-i ■ (4) 

Observe that if some A £ (0, 1) satisfies 

then Pj = A- 7 satisfies (jj). Here we would have to show that $5§ has a unique 
"reasonable" solution for each d, and that A J is in fact the unique solution to 
|@J. We can compute the probability that we eventually reach 0: 

P[0 eventually reached] = V ^lU \i = (i + A + A 2 + ■ ■ ■ + A*" 1 )" , 

3=0 

since (1 + x + x 2 + ■ ■ ■ + a; d_1 )" = J2i=o Ta(n,i)x l . The above expression 

J} X Z\) J ■ From ([5]) we learn that 
X d = ^rEr i an ^ plugging this into the previous expression yields 

A d -1A" f^-^Y 



d(X-l)J \d(X-l)J \d(k-l) 

Now the probability that One-Schoning-Run finds a satisfying assignment is at 
least ( d(k-i) ) o/u i an d if we repeat it ( k ) P°ly( n ) times, we find a 
satisfying assignment with constant probability (if one exists). This is exactly 
the running time of Schdning's algorithm we got before. Hence we see: Running 
Schoning with G being Kd or being the directed cycle makes no difference. 

B Constructing the Covering Code 

We show how to deterministically construct a code CC [d] n of covering radius 
r, i.e., \J aeC Br (a) = [d] n , for G being the directed cycle on d vertices. The 
construction is just a generalization of the one in Dantsin et al. [2]. 

Lemma B.l. Let G be the directed cycle on d vertices. For any n S N and x >, 
there is an r £ {0, . . . , (d — 1)"} such that 

(G) i {\+x+x 2 + ---+x d - i r 

voh '{n,r) > 



(d-l)n + l 

and there is a code C C [d] n of size at most 

[d] n x r 



, > , d-nn P ol y(") 

(1 + x + x + ■ ■ ■ + x a L ) n 

which can be constructed deterministically in time 0(\C\). 
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Proof. The proof idea is as follows: A probabilistic argument shows that a code 
C* of claimed size exists (one obtains C* by sampling random points in [d] n ), 
and then one invokes a greedy polynomial time approximation algorithm for the 
Set Cover problem (see [5], for example). This returns a code of size at most 
|C*|poly(n). The problem is that this instance of Set Cover has a ground set of 
size d n , and d n sets to choose from, thus the approximation algorithm will take 
at least d n steps. As in Dantsin et al. [2], we solve this problem by partitioning 
our n variables into b blocks of length n/b each, where b is a constant, depending 
on d but not n. 

Let us be more formal. We first construct a covering code for [d] n ^ b . By 
Lemma I3T21 we know that for any x > 0, there is an r € {0, . . . , (d— l)n/b} such 

that 

1 (1 + x + ■ ■ ■ + x d - 1 ) n / b 



il {G) {n/b,r)> 



(d- l)n + l 



Using this r, we choose a set C* C [d] n ^ b by randomly sampling n | (G) '. 

elements from [d]™' 6 , uniformly at random with replacement. This is only a 
feature of the proof - the sampling is not part of our deterministic construction. 
For any fixed (3 e [d] n l b , it holds that 



aec 



pp * u ^ g) («)] - ( i - vol(G !y r) l lc ' < e- Md " /6) - d- n,b 



By the union bound, we see that with non-zero probability, no assignment f3 is 
uncovered, and thus there exists a code C* of desired size and covering radius 
r. We construct an instance of Set Cover: The ground set is [d]™/ h , and the set 
system consists of all Br '(a) for a 6 [d]"/ 6 . The deterministic polynomial-time 
approximation algorithm will in time poly (cf l//h ) find a code C C [d] n ^ b of size 
0(\C*\n). We define C C [d] n by C := C b , the 6-fold Cartesian product. It is 
easy to see that 

\jB%\a) = [dr 

and 

\C'\ = \C\ b <^(i + x + ... + x d -Yvo\y(n) b . 

By choosing b large enough, although still constant, we can make sure that the 
running time of the approximation algorithm is at most \C'\. This concludes the 
proof. □ 

Actually the proof works as well for arbitrary vertex-transitive graphs, not only 
directed cycles, but the formulas become uglier. 



